A Deep Diacritics-Based Recognition Model for Arabic Speech: Quranic Verses as Case Study

نویسندگان

چکیده

Arabic is the language of more than 422 million world’s population. Although classic Quran that 1.9 billion Muslims are required to recite, limited speech recognition exists. In Arabic, diacritics affect pronunciation a word, change in diacritic can meaning word. However, most Arabic-based models discarded diacritics. This work aims recognize while considering by converting audio signals diacritized text using Deep Neural Network (DNN)-based models. The DNN-based model recognizes DNN which outperformed traditional systems’ phonetics dependency. Three were developed speech: (i) Time Delay Network-Connectionist Temporal Classification (CTC), (ii) Recurrent (RNN)-CTC, and (iii) transformer. A 100hours dataset recordings has been used. Based on results, RNN-CTC obtained state-of-the-art results with lowest word error rate 19.43% 3.51% character rate. recognized character-by-character reliable compared transformers’ whole-sentence behaviour. performed well clear unstressed short sentences. Moreover, effectively out-of-the-dataset sounds. findings recommend continuing efforts enhancing diacritics-based obtain better performance. pretraining large could accurate recognition. outcomes be used enhance existing solutions supporting

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Arabic font recognition based on diacritics features

Many methods have been proposed for Arabic font recognition, but none of them has considered the specialty of the Arabic writing system. Most of these methods are either general pattern recognition approaches or application of other methods which have been developed for languages other than Arabic. Therefore, this paper is the first attempt to present an alternative method for Arabic font recog...

متن کامل

Automatic Arabic diacritics restoration based on deep nets

In this paper, Arabic diacritics restoration problem is tackled under the deep learning framework presenting Confused Subset Resolution (CSR) method to improve the classification accuracy, in addition to Arabic Part-of-Speech (PoS) tagging framework using deep neural nets. Special focus is given to syntactic diacritization, which still suffer low accuracy as indicated by related works. Evaluati...

متن کامل

Speech Emotion Recognition Using Scalogram Based Deep Structure

Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...

متن کامل

A Pilot PropBank Annotation for Quranic Arabic

The Quran is a significant religious text written in a unique literary style, close to very poetic language in nature. Accordingly it is significantly richer and more complex than the newswire style used in the previously released Arabic PropBank (Zaghouani et al., 2010; Diab et al., 2008). We present preliminary work on the creation of a unique Arabic proposition repository for Quranic Arabic....

متن کامل

Maximum Entropy Based Restoration of Arabic Diacritics

Short vowels and other diacritics are not part of written Arabic scripts. Exceptions are made for important political and religious texts and in scripts for beginning students of Arabic. Script without diacritics have considerable ambiguity because many words with different diacritic patterns appear identical in a diacritic-less setting. We propose in this paper a maximum entropy approach for r...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2023

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2023.3300972